A tree augmented classifier based on Extreme Imprecise Dirichlet Model
نویسندگان
چکیده
0888-613X/$ see front matter 2010 Elsevier Inc doi:10.1016/j.ijar.2010.08.007 ⇑ Corresponding author. E-mail addresses: [email protected] (G. Corani), ca We present TANC, a TAN classifier (tree-augmented naive) based on imprecise probabilities. TANC models prior near-ignorance via the Extreme Imprecise Dirichlet Model (EDM). A first contribution of this paper is the experimental comparison between EDM and the global Imprecise Dirichlet Model using the naive credal classifier (NCC), with the aim of showing that EDM is a sensible approximation of the global IDM. TANC is able to deal with missing data in a conservative manner by considering all possible completions (without assuming them to be missing-at-random), but avoiding an exponential increase of the computational time. By experiments on real data sets, we show that TANC is more reliable than the Bayesian TAN and that it provides better performance compared to previous TANs based on imprecise probabilities. Yet, TANC is sometimes outperformed by NCC because the learned TAN structures are too complex; this calls for novel algorithms for learning the TAN structures, better suited for an imprecise probability classifier. 2010 Elsevier Inc. All rights reserved.
منابع مشابه
Credal Nets with Probabilities Estimated with an Extreme Imprecise Dirichlet Model
The propagation of probabilities in credal networks when probabilities are estimated with a global imprecise Dirichlet model is an important open problem. Only Zaffalon [21] has proposed an algorithm for the Naive classifier. The main difficulty is that, in general, computing upper and lower probability intervals implies the resolution of an optimization of a fraction of two polynomials. In the...
متن کاملRestricting the IDM for Classification
The naive credal classifier (NCC) extends naive Bayes classifier (NBC) to imprecise probabilities to robustly deal with the specification of the prior; NCC models a state of ignorance by using a set of priors, which is formalized by Walley’s Imprecise Dirichlet Model (IDM). NCC has been shown to return more robust classification than NBC. However, there are particular situations (which we preci...
متن کاملThe imprecise Dirichlet model as a basis for a new boosting classification algorithm
A new algorithm for ensemble construction based on adapted restricting a set of weights of examples in training data to avoid overfitting and to reduce a number of iterations is proposed in the paper. The algorithm called IDMBoost (Imprecise Dirichlet Model Boost) applies Walley’s imprecise Dirichlet model for modifying the restricted sets of weights depending on the number and location of clas...
متن کاملUpper entropy of credal sets. Applications to credal classification
We present an application of the measure of entropy for credal sets: as a branching criterion for constructing classification trees based on imprecise probabilities which are determined with the imprecise Dirichlet model. We also justify the use of upper entropy as a global uncertainty measure for credal sets and present a deduction of this measure. We have carried out several experiments in wh...
متن کاملThe multilabel naive credal classifier
We present a credal classifier for multilabel data. The model generalizes the naive credal classifier to the multilabel case. An imprecise-probabilistic quantification is achieved by means of the imprecise Dirichlet model in its global formulation. A polynomial-time algorithm to compute whether or not a label is optimal according to the maximality criterion is derived. Experimental results show...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. J. Approx. Reasoning
دوره 51 شماره
صفحات -
تاریخ انتشار 2010